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Abstract: In Bianco et al. (2001), the authors present the Sensor Location Problem: that of locating 
the minimum number of traffic sensors at intersections of a road network such that the traffic flow on the 
entire network can be determined. They offer a necessary and sufficient condition on the set of monitored 
nodes in order for the flow everywhere to be determined. In this paper, we present a counterexample that 
demonstrates that the condition is not actually sufficient (though it is still necessary) . Wc present a stronger 
necessary condition for flow calculability, and show that it is a sufficient condition in a large class of graphs 
in which a particular subgraph is a tree. Many typical road networks are included in this category, and we 
show how our condition can be used to inform traffic sensor placement. 
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1 Introduction 



Traffic congestion is a significant problem in most major cities in the world. An important first step in 
mitigating road congestion is to know the distribution of cars on each road of the network. This can be 
achieved by using traffic sensors to count cars traveling into and out of an intersection. However, placing 
sensors on every intersection is not only prohibitively expensive, it is also inefficient: if some sensors were 
removed, traffic flow through those intersections might still be calculated by applying flow conservation laws 
and knowledge of the fraction of cars turning in each direction at each intersection. In fact, even a vertex 
cover is inefficient for these reasons. Thus, we want to locate the minimum number of sensors such that we 
can still determine the distribution of cars in the entire network. This problem was introduced in |Bianco| 



et al. (2001) and named the Sensor Location Problem, (SLP). 



Bianco et al. (2001 ) present a necessary and sufficient condition on the set M of monitored intersections 



such that the traffic flow on the entire network is calculable. We present a counterexample demonstrating 
that the condition, while necessary, is not sufficient. Using the insights provided by the counterexample, 
we develop a stronger necessary condition that, while not sufficient in general, is sufficient in a large class 
of networks in which a particular unmonitored subgraph, to be defined in this paper, is a tree. We present 
several examples of road networks, including the standard grid network, to which this sufficient condition can 
be used to confirm that the flow can be completely specified. Moreover we present examples for which the 
condition is not sufficient, but where the failure of the necessary condition also provides useful information 
about the network. 



First, we review the terminology and notation used in Bianco et al. (2001). Then, we present our 
counterexample in Section [3j and develop a matrix representation for the problem in Section [4] Section 
[5] derives a graph-theoretic necessary condition for flow calculability, and Section [6] demonstrates that this 
condition is sufficient in the case when each unmonitored subgraph is a tree. In Section[7|we provide examples 
of how this new condition could be used for decision support by traffic engineers. We offer concluding remarks 
111 Section |8] 



2 Definitions 

Let the road network be represented by a directed graph G — (V,A), where V is a set of intersections and 
A is a set of "two-way" directed arcs (roads). That is, if u,v € V and uv G A, then vu £ A, but the traffic 
flow on arc uv need not equal that on arc vu. We represent the traffic flowing over the roads by a network 
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flow function / : A — >• M that satisfies the flow conservation law at each vertex v E V: 



where w~ is the set of arcs with head at u, w + is the set of arcs with tail at v, and S v is the balancing flow 
at vertex v. The sources and sinks of traffic, called centroids, are the vertices with non-zero balancing flows; 
the set of all such centroids we denote B. Because flow is conserved at each vertex, we have J2 v ev ^ v = ^- 
We assume that while the set B is known, the values of the balancing flows for vertices in B are unknown. 

To determine the network flow function /, sensors are placed at various intersections in the road network. 
We denote the set of monitored vertices by M. If an intersection is monitored, then the number of cars 
entering and leaving the intersection along each road connected to the intersection is revealed. We denote 
the set of vertices directly adjacent to vertices in M via an arc in A as A(M). 

We finally assume knowledge of the turning ratios at every intersection in the network. The turning 
ratio c vu for arc vu at vertex v is simply the percent of incoming traffic to v that leaves along arc vu. That 
is, 

fvu C-vu ^ ^ fe- (2) 

Define the turning factor of arc vu with respect to given reference arc vw to be the ratio of their turning 
ratios: 

a vu — . (6) 
Then we can write the flow /„„ of any outgoing arc vu from v in terms of f vw as 

fvu — ^vufvw- (4) 

The values for the turning ratios can be obtained from historical data about traffic patterns if available, or 
can be determined easily by monitoring existing traffic patterns for a short time. 

When a set M of vertices is monitored, the flow on all arcs between vertices in M and between M and 
A(M) are known, as well as the balancing flows at each centroid in M. Applying the turning ratios, we also 
know the flow on all arcs between vertices in A(M). We call the set of arcs connecting vertices in M and 
A(M), on which the flow can be computed directly from monitoring M and applying turning ratios, the 
combined cutset of M: 
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Definition 2.1 ( Bianco et al.| ( 2001 )). The combined cutset of M, Cm, is the set of arcs in the subgraph 



of G induced by M U A(M). 

As an aside, we can also use the turning ratios to determine outgoing flow from vertices in A(M) to vertices 
neither in A(M) nor M; these arcs are not part of the combined cutset, but will be used later. 
We are now ready to define the Sensor Location Problem (SLP): 



Definition 2.2 (Sensor Location Problem, Bianco et al. (2001 1). Given a two-way directed graph G — (V, A), 



a network flow function f and a set of centroids B, what is the smallest set M of monitored vertices such 
that knowledge of all turning ratios, the values of f on incoming and outgoing arcs of M and balancing flows 
S v on M uniquely determines f and the balancing flows S v everywhere on G? 

We focus on the verification version of SLP and seek a condition to verify that a proposed set M uniquely 
determines / and the balancing flows. 

3 A proposed condition and counterexample 

We see from the definition of the combined cutset of M that these are arcs over which the problem of 
determining the flow has already been solved directly from monitoring. Thus, we can remove Cm from the 
graph, and try to use the remaining flow coming out of A[M), turning ratios and flow balance equations to 
determine the flow everywhere else in the graph. We therefore define the unmonitored subgraph of G to 
be the subgraph G that remains when Cm has been removed from the graph: G = (V — M, A — Cm)- This 
subgraph contains all arcs over which the flow is not completely determined by monitoring. 

The unmonitored subgraph is often, but not always, disconnected. We call the i th connected component 
of the unmonitored subgraph the i th unmonitored component and label it G i . We label the set of 
centroids in that component B iy and the set of (originally) adjacent vertices in that component A^M). 



Bianco et al. ( 2001[ ) present a proof of the following condition on the set M in order for the flow function / 



to be uniquely determined. While this is a necessary condition, we present an example that demonstrates it 
is not actually sufficient in general. 



Theorem 3.1 (Bianco et al. (2001)). Given a set of monitored vertices M, the flow on a digraph G can be 



uniquely determined everywhere if and only if for every unmonitored component Gi of G, 

\B t \ < \Ai(M)\. 
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In their proof of this theorem, the authors compare the number of unknown arc and balancing flow 
variables to the number of flow balance and turning ratio equations when this condition holds. They argue 
(correctly) that the number of equations must be at least the number of unknowns, which happens only if 
\Bi\ < \Ai(M)\. However, in their argument that the condition is sufficient, they neglect the possibility that 
some of the resulting equations might be linearly dependent, and thus the solution will not be unique. 

To see this, consider the following example (shown in Figure [l]). Let 5 + (u) be the outgoing degree of 
vertex u, and suppose that the turning ratios c uv = \/5 + {u) for all arcs uv (the flows on all outgoing arcs 
from vertex u are equal). By monitoring vertex a, the unmonitored subgraph G induced by removing the 
combined cutset has only a single connected component, consisting of the vertices b,c,d,e,f and the arcs 
between them. A(M) in this component is {b,d}, and B — M = {e,f}. Thus \A(M)\ = \B- M\ = 2, and 
by Theorem |3.1[ we should be able to determine / and the vector S of balancing flows uniquely. 



Fi gure li A counterexample to the flow calculation theorem (Theorem |3.1 [ )■ If wc monitor vertex a in the above graph, the 
graph with cutset Cm = {ab,ba, ad, da} removed satisfies the conditions in Theorem |3.1| However, we cannot calculate / e£ j or 
ffd from the known information. 

However, suppose we observe 4 units of flow along arcs ab, ba, ad, and da. We apply the flow balance 
equation and knowledge of the turning ratios sequentially at each vertex until we get stuck. Consider vertex 
b. It is not a centroid, so Sb = 0, and since flows on all outgoing arcs are equal, fb c = fba = 4. To preserve 
balance of flow, f c t, = 4 as well. By a similar logic, we obtain f c( i = fd c = 4 at vertex c and fd e — fdf — 4 at 
vertex d. We cannot determine f ec i and ffd because both e and / are centroids, and their balancing flows 




S,= 7 



■5 



are unknown. Balancing flows in the network must sum to zero, so Sf = ~S e , leaving us with the following 
system of equations having three unknowns and three equations, as predicted by Theorem 3.f 



fed 
fed 



ffd 



ffd 



S e = 4 

S e = 4 



(5) 



Notice, however, that these equations are linearly dependent and thus fail to admit a unique solution. 



Therefore, the condition provided in Theorem 3.1 is not sufficient. 

Unfortunately, there are many such counterexamples, including cases when the graph is a tree or when 



the inequality in the theorem is strict. Fortunately, the subsequent work of Bianco et al. (2001) and Bianco 



et al. (20061 is correct despite the erroneous Theorem 3.1 Nonetheless, it is still valuable to understand why 



the theorem is incorrect and to formulate a new theorem that guarantees the calculability of traffic flows on 
a monitored graph. To better understand the circumstances under which Theorem 1 3 . 1 1 f ails , we next examine 
the problem via the graph's incidence matrix. 



4 SLP and Invertible Matrices 

Let E be the \V\ x \A\ incidence matrix where the (u, e) th entry is —1 if vertex u is the tail of arc e, 1 if it 
is e's head, and if e is not incident to u. Let f be the |yl|-length vector of unknown arc flows and S the 
|y|-length vector of balancing flows. The system of linear flow conservation constraints at each vertex then 
takes the form 

Ef + S = x, (6) 

where x = 0. Notice that the sum of these equations yields the balancing flow constraint ^2 ueV S n = 0, so 
we do not need to add this constraint to system pj. 

This system does not include the known turning ratios or the observed flow along monitored arcs, and 
thus contains more unknown variables than are necessary. We can therefore reduce this system of equations 
to a more compact representation, as follows: 

1. For each vertex u £ V, we designate an arbitrary outgoing arc e u to be the canonical arc for vertex 
u. Since we know the turning ratios of the graph, the flow over any arc uv is /„„ = a uv f eu . This 
reduces the number of flow variables from \A\ to |V|, and we can modify the unknown flow vector f to 
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include only the \V\ canonical arcs. 



2. Having expressed the flow on any arc uv in terms of the flow on e u , the flow balance matrix E collapses 
into a square matrix E, where row u still corresponds to the balance equation at vertex it, and column 
v corresponds to the canonical arc for vertex v, e v . The (u,v) th entry of E is given by 



if it and v are connected 



Eu, adj 



acent to 



a,.,,, if u = v 



if it and v are not connected 



3. We also augment E with \B\ columns for the unknown balancing flows at the centroids. The column 
corresponding to the centroid at vertex u has a 1 in the u th row and O's everywhere else. Likewise, 

f 

we create a single (|V| + |B|)-length vector g = of unknown canonical arc and balancing flows. 

S 

Equation ^ then becomes 

Eg = x, (7) 

where x is still the zero vector. 
We next incorporate the known flow values obtained by monitoring vertices in M. 

4. For each vertex m € M, the flow along m's canonical arc and the balancing flow (if m is a centroid) 
are known. We can remove row m from the matrix E. We also remove column m, corresponding to 
vertex m's canonical arc. Next, we update the right-hand side vector x with the known flow values by 
subtracting f Bm times the removed m th column from x. This is equivalent to subracting a mu f em from 
the u th entry of x for each vertex u adjacent to m. If m is a centroid, we also remove the column of E 
corresponding to its balancing flow. We likewise remove the entry from g corresponding to / 6m (and 
S m if m is a centroid), and remove the m th entry from x. 

5. For each vertex a G A(M), the outgoing flow from a to any vertex m £ M is monitored, so by turning 
ratios, we can deduce the flow over a's canonical arc. We therefore remove column a from E and subtract 
f Ea = -^—fam times column a from the right-hand side vector x. This is equivalent to subtracting 
a au f ea from the u th entry of x for each vertex u adjacent to a and adding J2 W adjacent to a a awfe a to 
the a th entry of x. We remove the entry from g corresponding to f ea . 
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Figure 2: A network in which the set of centroids is B = {b, d, e, /}, and vertex e is monitored, revealing the flows indicated 
on the arcs into and out of e. In this case, we can calculate the flow everywhere on the graph, as demonstrated by equation |9jl 
having a unique solution. 



We name the resulting coefficient matrix for the system of equations the flow calculation matrix F 

and rewrite for the last time our original system of equations 

Fg = x. (8) 

If equation (Tsl) has a unique solution (which occurs when the columns of F are linearly independent), 
then we can uniquely determine the flow everywhere on the graph. 



For example, consider the graph in Figure [2j with M = {e} and flows on monitored arcs as indicated in 
the figure. We choose arcs ab, ba, ca, db, ed and fb to be our canonical representatives for each vertex. We 
also assume that all turning ratios are equal except at vertex e (so a uv — 1 for all uv ^ e), where monitoring 
has revealed the turning factors to be a e t = 2 and a ec = 1. The corresponding reduced system of equations 
is: 
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It is easy to check that rank(F) = 5, and thus the columns are linearly independent; this implies that 
equation (|8| is solvable for the graph in Figure [2j 



5 A new necessary condition 

The flow is uniquely calculable if and only if the matrix F has full column rank. An obvious necessary (but 
insufficient) condition is for F to have at least as many rows as columns. F has |V| — \M\ — \A(M)\ + \B — M\ 
columns and \V\ — \M\ rows. Therefore, we require \B — M\ < |j4(M)|. The necessary condition proved in 



Bianco et al. (20011 is stronger: \{B — M)j| < |A(M)j| for all connected components i in the unmonitored 
subgraph induced by removing the arcs in the combined cutset. In fact, we can prove an even stronger 
necessary condition that relies solely on the topology of the graph. This condition correctly identifies that 
the example of Figure [l] will not yield a unique solution, whereas the original condition \{B — M)j| < |A(M)j| 
could not predict this. 

Our difficulty in calculating the flow arose when we reached vertex d. Although we knew the flow exiting 
vertex d along the arcs toward e and /, we were unable to determine the flow entering d because both e and 
/ were centroids, contributing unknown balancing flows. Traffic originating or terminating at vertices e and 
/ got "mixed up" at vertex d and could not be uniquely differentiated. 



This observation leads us to define a B-path, which we use to correct Theorem 3.1 



Definition 5.1. A B-path is a path starting at a centroid and ending at a vertex in A(M). 



This is a similar, but less restrictive, definition than that given for MB-feasible paths in Bianco et al. ( 2006 1 . 
Using this definition, we present the following theorem, which provides a stronger necessary condition for 
flow calculability. 

Theorem 5.2 (Statement A). Let G — (V,A) be a two-way directed graph with centroid set B, and let M 
be a set of monitored vertices. The flow on arcs in G and the balancing flow at the vertices in B can be 
uniquely determined everywhere only if there exists a set V of \B — M\ vertex disjoint B -paths. 
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Figure 3: The graph from Figure together with a set of B-paths. However, any two B-paths must pass through vertex d, 
so there is no set of \B — M\ disjoint B-paths associated with M. 

This is a stronger necessary condition than that given in Theorem |3 . 1 1 because it is not satisfied by our 
counterexample in Figure [l] We see in Figure [3] that any set of two -B-paths will be forced to intersect at 
vertex d. Thus, the number of disjoint B-paths is smaller than \B — M\ and we are unable to calculate the 
flow. 

To prove Theorem |5.2[ we must translate its statement related to the topological structure of the network 
into our algebraic framework described earlier. We note first that the number of vertex-disjoint B-paths 
cannot be larger than the size of a minimum disconnecting set C between B — M and A(M), by Menger's 
theorem. Thus, we require |C| > \B — M\. (In fact, the size of the minimum disconnecting set will never 
strictly exceed \B — M\). 

Next, we partition the graph G into its unmonitorcd components by removing the combined cutset. If u 
and v are in different partitions in the graph, then there was no path from u to v in G except through M or 
along an a\a^ edge for some eti and a-i G A(M). Because all rows and columns corresponding to M and all 
columns corresponding to A(M) have been removed from the matrix, vertex it's flow balance equation will 
not include any e v or S v terms, and e u and S u will not appear in vertex v's flow balance equation. Thus, we 
can rearrange the flow calculation matrix F into block form by collecting rows and columns corresponding 
to vertices in each unmonitored component, and prove the theorem for each component independently. We 
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rephrase our original theorem accordingly: 



Theorem 5.2 (Statement B). Let G,B, and M be as in Theorem 5.2 (Statement A), with the graph 
partitioned into unmonitored components and the flow calculation matrix partitioned into blocks as described. 
For each unmonitored component i, let Ci be the minimum vertex cut between (B — M)i and A(M)i, (If 
(B — M)i is empty, then let C — ). rank(F l ) = ^{columns of F 1 } (and hence the flow on G i is calculable) 
only if\Ci\ = \{B-M) t \. 



Proof. For ease of notation, we drop the subscript i and henceforth refer to all sets in the context of a given 
unmonitored component i. We assume the component contains at least one centroid, otherwise the theorem 
is true trivially because both B—M and C are empty. Within component i, we call Vm the set of vertices that 
are not in M or A(M) and are connected to M by some path that does not pass through C (i.e. they are on 
the M side of the cut C) . Similarly, we call Vb the set of vertices not in B — M that are on the B — M side of 
the cut. Note that C, A(M), and B — M could all overlap, as shown in Figure [lj we label these intersections 
as shown, where X A ( M)tC = (A(M) n C)\(B - M), X A ( M) = A(M)\(C U (B - M)), etc. Note that since 
C is by definition a vertex cut between B — M and A(M), the set X A (m) x b-m = (M M ) D (B - M))\C is 
empty. 




Figure 4: The partition of the vertex set for Theorem |5.2| Vm is the set of unaccounted-for vertices on the M side of the cut, 
and Vb is the set of unaccounted-for vertices on the B — M side of the cut. Bold arrows indicate possible connections between 
sets. Some of these sets may be empty — in particular, note that by definition, there can be no vertices in ((B — M) n A(M)) \ C, 
since C must separate B — M and A(M). The shaded-in regions correspond to the columns included in the submatrix F*. 



Let us consider a submatrix F* of F that contains only the columns corresponding to canonical arcs for 
vertices in Xb-m an d Vb and to balancing flows at vertices in Xb-m, Xc,b-m and X A (m).c.b-m ■ These 
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are the shaded regions of Figure [4j Since F has linearly independent columns, F* has full column rank, and 



K <R-Z, (10) 

where K is the number of columns of F*, R the number of rows and Z the number of zero rows. By 
construction, 

K=\X B -m\ + \V b \ + \B-M\ 

and 

R = \ X A(M)\ + \Xa(m),c \ + \Xa(m),c.b-m\ + \Xb-m\ + 
\X c \ + \Xc,b-m\ + \Vm\ + \V b \. 

Next we determine Z. As we see in Figure|4j there are no arcs from vertices in Vm or Xa(M) to vertices in 
Xb—m or Vb by definition of the cut C . Moreover, vertices in Vm and Xa<m) are n °t centroids. Therefore, 
the rows in F* corresponding to vertices in Vm or Xa(m) are all zero, and Z > \Vm\ + |-^.a(m)I- Applying 



inequality (10) and canceling common terms, we see that \B — M\ < \Xc\ + \Xa(m),c\ + \Xc,b-m\ 



\Xa(m),C,b-m\ = | C | - Because \C\ can never exceed \B — M\, we have \C\ = \B — M\ 



As an example, we walk through the construction of the F*-matrix for our original counterexample of 
Figure [T] We start with the flow calculation matrix F in the system of equations Fg = x: 
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When we remove the cutset associated with monitored vertex a, the minimum vertex cut between A(M) = 
{b, d} and B — M = {e, /} is C — {d}. \C\ ^ \B — M\, so we should not be able to calculate the flow. We 
generate the F* submatrix using the sets Xa(m) = {b}, Vm — {c}, Xa(m),c — {d}, and Xb-m = { e >/} : 
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(^) Monitored Vertex 



Centroid Vertex 




Figure 5: In this graph, monitoring vertex e creates 2 disjoint B-paths, satisfying the conditions of Theorem |5.2| However, 
the F matrix does not have linearly independent columns, and thus we cannot calculate the flow on the graph. 
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0-1 1 J 

Notice that the first two rows of the matrix are 0, which means that the rank of this submatrix can't be any 
higher than 3. This implies that the rank of F cannot equal the number of columns, and thus the flow on 
the graph cannot be calculated. 



6 A sufficient condition for trees 

Next we turn to the question of sufficiency. Unfortunately, the condition is not sufficient for graphs in general, 
but is sufficient in the case of networks whose unmonitored components are all trees. Figure [5] provides an 
example of a general graph in which there are \B — M\ vertex-disjoint B-paths, but the matrix Em still does 
not have linearly independent columns and the flow on the graph cannot be calculated. We see that the 
unmonitored subgraph of this example, which we obtain by removing M's combined cutset (in this case, the 
arcs ce, ec, de, and ed), is not a tree. However, the following theorem states that as long as the unmonitored 
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components of a graph are all trees, our condition is sufficient to guarantee the calculability of traffic flow. 



Theorem 6.1. Let G,B, and M be as in Theorem \5.2\ with the flow calculation matrix partitioned into 
blocks as described. For each unmonitored component i, let Ci be the minimum vertex cut between (B — M)i 
and A{M)i. If the i th component is a tree, then rank(F J ) = ^{columns o/F 1 } (that is, the flow on block i 
is calculable) if and only if |Cj| = \(B — M)j|. 

Prior to proving this theorem, we first prove the following lemma: 

Lemma 6.2. Let G be a two-way directed tree with known turning ratios, containing no centroids and having 
root vertex r. Suppose that G is attached at r to a graph G at vertex v with known flow value f vr . Then 
f rv — f vr and the flow on G can be determined. 

Proof. We prove this by induction on the number n of vertices in G. As a base case, suppose n = 1, then G 
contains only the leaf node r. Because r is not a centroid, its balancing flow is zero, so f rv = f vr , and the 
flow on G has been determined. 

Suppose the statement is true for any tree of size strictly less than n and let G have size n. Consider 
the vertex r € G which is attached to graph G at vertex v. Because G is a tree with no centroids, flow is 
conserved, so f rv = f vr , and all other outgoing flows of r can be determined using turning ratios. Moreover, r 
is connected to deg(r) subtrees of G each of size strictly less than n and having root vertices Uj, i — l...deg(r) 
with known incoming flow values frvt- So the flow on each subtree can also be determined, and f ViT — fm- 
We have therefore found the flow on the entire graph. ■ 

Thus, on a subtree with no centroids, the flow can be computed knowing only a single incoming arc to 
the tree. We now prove Theorem |6.1| 



Proof. Theorem 5.2 handles the necessary condition. Let n = \(B — M)<|. Because |Cj| = \(B — M)j|, 
|A(M)j| must also be at least n, and there exists a pairing between a subset of A(M)i and the vertices in 
(B — M)i such that the set of paths between all pairs a, € A(M)i and 6,; £ (B — M)j are vertex disjoint. 
(If |A(M)j| > n, then the "extra" vertices in A(M)i will act like centroids with known balancing flow equal 
to the incoming flow from M and can be treated like any other non-centroid.) We will induct on n to show 
how to propagate the flow calculation through the graph. 

If n = 0, then our partition consists of a tree having no centroids that, in the original graph, is connected 
to a vertex in M . The flow along an incoming arc to G i from M is known due to monitoring, so our 



Lemma 6.2 tells us that the flow along every arc in G i can be determined. 
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m 



T 2 




b 



Fi gure 6i The tree structure induced by the pairing of centroids and adjacent vertices in Theorem |6.l| Note that no path 
from a vertex a,j S A(M) to its pair bj £ B — M can cross any other vertex in A(M). This allows us to treat each subtree 
separately when calculating the flow along it. 

Suppose the theorem is true for any partition having \ {B — M)i\ < n unmonitored centroids, and consider 
a partition having \{B — M)j\ — n. Without loss of generality, consider centroid b\ and its matching adjacent 
vertex a±. Let Tj, T<j efl ( 0l ) be the subtrees of T rooted at a\S neighbors t\, tdeg(ai); an d assume that 
T\ U {ai} is the tree containing the (a 1; b\) pair (See Figure [6]). Every subtree maintains the original pairing 
of vertices in A{M)i and (B — M), because these pairings corresponded to vertex-disjoint paths which could 
not pass through a±. Moreover, each subtree Tj,j ^ 1 has strictly fewer than n such pairings and satisfies 
the induction hypothesis. The flow on these subtrees can be calculated. It remains to determine the flow on 
the edges between oi and tf f ai u is an outgoing flow from a\ which is known by monitoring and application 
of turning ratios. ft jai can be expressed in terms of the flow on tj's canonical edge. Thus, there is still a 
unique solution for the flow on Tj U {ai}, and all incoming flow to a\ from trees T2, T^ eg / ai y has been 
determined. 

Next we consider the tree T\ U {ai}, by propagating flow calculations along the path from a\ to b±. The 
outgoing flow from a± along the path is known by applying turning ratios. If a\ =61, then in fact T\ is 
empty. a\s incoming and outgoing flows can be used to find its balancing flow, and the flow on G i has been 
completely determined. If a\ 7^ 61, the incoming edge to ai from the next vertex on the path to b\ is the only 
edge incident to a\ whose flow has not yet been determined; it can be determined by flow conservation. By 
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a similar logic, we can propagate these flow calculations along the path from a\ to b\ until we reach the first 
vertex w having degree greater than 2. All outgoing flows of w are known by applying turning ratios to the 
known outgoing flow from w to the vertex preceding w in the path. Each branch of w that does not contain 
b\ is once again a subtree that maintains the (strictly fewer than n) original (B — M)i and A(M)i pairings. 
By our induction hypothesis, the flow on this branch can be determined and by the same reasoning as above, 
the flow on the edges between w and the branch can also be determined. Flow conservation determines the 
incoming flow to w from the next vertex on the path to b\. We continue this way until vertex b\ is reached. 
All outgoing flows from bi are determined by applying turning ratios to the known outgoing flow from b\ 
into the vertex preceding bi in the path. The flow on branches stemming from b\ can be determined by our 
induction hypothesis, and the balancing flow at b\ is simply the difference between all outgoing and incoming 
flows at b\. The flow on the tree has been calculated. ■ 

An obvious corollary is the following: 

Corollary 6.3. If G is a two-way directed tree with centroid set B and monitored vertex set M , the flow 
on G can be calculated if and only if there exist at least \B — M\ vertex- disjoint paths between A(M) and 
B - M. 

This suggests that it is the presence of cycles in unmonitored subgraphs that can occasionally lead to 
difficulties in uniquely determining the flow. 



7 Applications to traffic sensor placement 

While it might at first seem restrictive to require the unmonitored subgraph to be a tree in order to guarantee 
flow calculability, we show in this section that this is not the case. In fact, a broad collection of road networks 
can have trees as unmonitored subgraphs. Moreover, even for those networks whose unmonitored subgraphs 
contain cycles, our condition can still provide useful information about the placement of traffic sensors. 



Consider the traffic network shown in Figure 7(a) This is a traditional grid network found in many 



cities, and it has twenty-five intersections, of which seven are considered to be centroids. A traffic planner 
might be interested in monitoring four intersections on the network to calculate the traffic flow throughout 



it. If she places the monitors on the four vertices shaded in Figure 7(a) then the unmonitored subgraph 



is as shown in Figure [7(b)| The condition of Theorem 3.1 that \B — M\i < \A(M)i\ is satisfied on both 



unmonitored components, but the rightmost component violates our necessary condition of Theorem 5.2 
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(b) Unmonitored subgraph obtained by removing 
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Figure 7: A road network with four monitored vertices (shaded) and seven centroid vertices (bold). The 
unmonitored subgraph has two components. The leftmost component has three centroids and five vertices 
in A(M), satisfying the necessary condition of Theorem |3.1| All centroids have a corresponding B-path, but 
because this unmonitored component is not a tree, we cannot be certain that the flow is calculable on this 
region of the network. The rightmost component has four centroids and five vertices in A(M), satisfying 



the necessary condition of Theorem 3.1 However, the centroid labeled X does not have its own B-path in 
this unmonitored component, and hence the traffic flow is not calculable in this region of the network, by 
Theorem [5^2] (Statement A). 



(Statement A) because the centroid marked by X does not have its own B-path. Therefore, we are able 
to conclude that flow is not calculable in this region of the network. The leftmost unmonitored component 
satisfies our necessary condition, but because this component is not a tree, we are unable to conclude that 
the flow is calculable in this region. 

If the traffic planner simply rearranges two of the monitored vertices, as shown in Figure [HJ she is easily 
able to construct a network whose unmonitored subgraph is a tree. The sufficient condition of Theorem |6.1| 
is satisfied, and she can conclude that by placing the monitors in this orientation, the traffic flow on the 
network will be completely calculable. 

It is worth pointing out that unmonitored subgraphs might be trees even on very large graphs with only 
a small fraction of monitored vertices. We considered an 18 x 18 grid network (324 vertices) on which 72 
vertices were monitored and all unmonitored subgraphs were trees satisfying the sufficient condition for flow 



calculability given by Theorem 6.1 More generally, it could be expanded to a 3fc x 3k graph for any integer 
k having 12fc monitored vertices. Many more examples of large traffic networks can be found for which 
Theorem 6.1 applies. And as we have seen in Figure [7J even when the unmonitored subgraph is not a tree, 
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combined cutset Cm 



Figure 8: A road network with four monitored vertices (shaded) and seven centroid vertices (bold). This 
network is identical to that of Figure [7] except that two of the monitored vertices have changed position. 
In this modified graph, the unmonitored subgraph is a tree, and every centroid has its own B-path. Thus, 
Theorem |6 . 1| guarantees the traffic flow is fully calculable on this network. 

failure to satisfy the necessary condition of Theorem |5.2| signals the need either to increase the number of 
sensors, or to rearrange their positions until the flow is calculable. 

8 Conclusions 

We have corrected an error in an earlier theorem regarding when a proposed set of vertices M in a two- 
way directed graph is a valid monitoring set for determining the flow on the entire graph. The topological 
insights of our counterexample led us to prove a new, stronger, necessary condition that is also sufficient 
on any unmonitored subgraph of the network that is a tree. We then showed by example the broad array 
of networks in which this condition could be used by traffic engineers to determine the placement of traffic 
sensors. 
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